A new 2-kbit/s speech coder based on normalized pitch waveform
نویسندگان
چکیده
Speech coding at very low bitrate is useful for purposes such as voice communication over computer networks. However, speech coding at around 2.0 kbit/s is di cult for CELP coders while maintaining a high quality. In this paper, a speech coding model called `normalized pitch waveform' and its quantization scheme are presented, aiming for effective compression coding of the `voiced' speech. Listening tests has proven that an e cient and high quality coding has been achieved at bitrate 2.0 kbit/s, less than half of the FS1016. Furthermore, this paper discusses the disadvantage of the normalized pitch waveform and presents an alternative method of using non-normalized pitch waveform. Encoding of a transitional `mixed' state between the `voiced' and the `unvoiced' state is discussed for further improvements.
منابع مشابه
Design of a toll-quality 4-kbit/s speech coder based on phase-adaptive PSI-CELP
This paper describes the design of a toll-quality 4-kbit/s speech coder based on phase-adaptive PSI-CELP. This adaptation method not only gives pitch periodicity to the random excitation but also synchronizes the basic point of the stored random vector with the pitch phase. We further improve the proposed coder by introducing a backward gain prediction scheme. In subjective evaluation experimen...
متن کاملAn 8 kbit/s ACELP coder with improved background noise performance
This paper describes an 8 kbit/s ACELP speech coder with high performance for both speech and non-speech signals such as background noise. While the traditional waveform matching LPAS structure employed in many existing speech coders provides high quality for speech signals, it has significant performance limitations for e.g. background noise. The coder presented here employs a novel adaptive g...
متن کاملA Pitch Pulse Evolution Model for a Dual Excitation Linear Predictive Speech Coder
This paper introduces a new technique to model the excitation waveform for a linear predictive speech coder The target appli cation is high quality speech coding for rates near kb s Our pitch pulse evolution model decomposes the excitation into two separate but simultaneous signals the evolving pitch pulse com ponent and the unvoiced noise like contribution A number of formulations for decompos...
متن کاملA 16-kbit/s wideband speech codec scalable with g.729
A wideband speech scalable codec is proposed for improving the flexibility in telecommunication networks. This coder is scalable with G.729 (ITU 8-kbit/s standard). Its decoder can process the incoming bitstream at three bit rates (8, 12, and 16 kbit/s) and provide a choice of speech types (wideband and telephone-band). The codec has a split-band structure, where both bands are coded by analysi...
متن کاملA Pitch Pulse Evolution Model for a Dual ExcitationLinear Predictive Speech
This paper introduces a new technique to model the excitation waveform for a linear predictive speech coder. The target application is high quality speech coding for rates near 4 kb/s. Our pitch pulse evolution model decomposes the excitation into two separate but simultaneous signals: the evolving pitch pulse component and the unvoiced, noise-like contribution. A number of formulations for dec...
متن کامل